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(54) Targeted delivery through a cationic amino acid transporter 

(57) The invention relates to the targeted delivery of 
substances to cells. The invention provides a virus-like 
particle or gene delivery vehicle provided with a ligand 
capable of binding to a hunian amino acid transporter. 
Provided are for example ligands tiiat can bind to the 
human transporter of cationic L-amino acids (hCAT1). 
Such hCATI binding molecules find applications in the 
design of vector systems for entry into human or pri- 
mate cells. Preferred are retroviral envelope molecules, 
which - when incorporated in a virus particle - can infect 
hCATI positive cells at high frequencies. Also within the 
scope of tfie invention are methods for the design of 
such hCATI blndng molecules. 
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Description 

[0001] The invention relates to the targeted delivery of substances to cells. 

[0002] Delivery of substances to cells allows specific treatnrtent of said cells with compounds that act in the targeted 
ce^ . For example, tumour cells, when targeted with toxic components, selectively die when said toxin is delivered to said 
cell. Yet other cells, when provided with a gene lacking in said cell, can be restored in their function, so-called gene ther- 
apy. 

10003] Delivery oi a compound to a cells preferably occurs with a vehicle or particle that effectively k>rings the com- 
pound to the desired cell or cells and than delivers said compound into that cell (in vivo or in vitro) where it can exert its 
acbon. For this purpose, for example particles such as virus-like particles are suited. These particles, often derived from 
Known viruses, such as retrovirus or adenovirus, are small enough to penetrate in-between tissues and cells and arrive 
at a ceH of dioice where it for example can fuse with said cell and deliver its compound. Said virus-like particles may or 
may not be infectious in ttiemselves. their main concern is ttie targeted delivery of the compound of interest such as a 
gene, a toxin or immunostimulating components such as antigens. 

[0004] Yet other examples are gene-delivery vehicles, specifically designed to bansfer a gene to a cell of interest 
V^rus- Ike particles capable of delivering a gene are examples of said gen6<lelivery vehicles, however, also other exam- 
ples of such vehicles, of non-viral origin, such as liposomes or microbodies. or even latex particles, are known Vehicles 
such as liposomes or miwobodies can of course also carry other compounds than a gene, in particular toxic or immu- 
nostimulating components such as antigens can be included in such a vehicle. 

[0005] These vehicles or particles all have in common ttiat they need to be provided with a molecule or fragment 
thereof (ligand) capable of binding with said targeted cell, allowing targeting of said particle or vehicles to cells T^ere 
IS a need for speafic or broadly applicable ligands that react with cell-surface receptors on cells. In particular tinere is a 
need Ibr ligands that react with cell-surfece receptors after which efficient transfer of said compound to said cell such 
as a gene, is possible. Especially in human medicine, such a ligand would enable better appltoation of gene-transfer 
25 therapy thanis possible now. m-s- =-uaiiww 

[0006] It has been a long-standing objective to exploit retrovirus technology in human gene ttierapy applications. How- 
ever, the infection spectrum of retroviruses limits the applications of ttiese viruses in such applications. All known env 
variants have a rather broad infection spectrum in common. Here lies one of the major shortcomings of current recom- 
binant refrovirus technology. For the purpose of gene tiierapy. reti-oviruses are very useful vehicles for the transfer of 
therapeutic sequences, if proper ligand-receptor targets were available. In conclusion, the concept of the use of retro- 
viruses in human gene therapy is well documented (Gordon and Anderson, 1994; Havenga et al 1997- Vile et al 
1996). However, it would be clearly advantageous and desirable to devise a sti-ategy for targeted delivery of r&r&A- 
ruses, and modification of the infection spech-um. 

[0007] The invention provides a virus-like particle or gene delivery vehicle provided with a ligand capable of binding 
0 a human ammo add ti-ansporter. The invention provides said particle or vehicles wherein said ligand comprises pet^ 
tide molecutes or fragments thereof binding said transporter, for exanple to hCATI . The peptides or fragments thereof 
can bind to lor example the third extracellular domain of ttie cationic amino acid transporter hCATi or can bind to cells 
aqpressing this domain of hCATI protein on their extracellular cell surface. These hCATI binding molecules can be pep- 
tides or antibody fragments displayed on a filamentous phage or as free molecules. In a preferred embodiment ttie 
invention provides a virus-like particle or gene delivery vehicle for delivery of genes to human cells, however, it is also 
iiSSiurtL*ton°^^ ^ ^^^^ °' ''^'"^ <»nPO"nds. such as toxins for selective Killing or antigens for 

10008] In aparticular embodiment of ttie invention, a virus-like particle or gene delivery vehicle is provWed comprisina 
at least one viral protein provided with said ligand. Included in the present invention is the use of hCATI binding ligands 
to provide a particle or vehicle ttiat employs hCATl to enter a hemopoietic stem cell or any other cell expressing hCATI 
hCATi ligands can be incorporated in ttie envelope of a retrovirus or the capsid of any other viral or non-viral gene 
transfer vehicle such as an adenoviral vector. Incorporation of ttiese hCATI binding sequences can be done using tech- 
niques Known in ttie art. "yiowi 
[0009] The invention provides a virus-like particle or gene delivery vehicle wherein said viral protein comprises an 
envelope protein. In a preferred embodiment ttie invention provides a a mutant retroviral envelope ttiat is derived from 
a wild-type ecotropic envelope and which employs hCATI to enter the human or primate cell by binding to hCATI Such 
a new retroviral envelope molecule, when incoiporated in a retroviral virion, will be able to infect hCATI positive cells 
such as human PHSCS at high efficiencies. The mutant retroinrai envetopes can be used to pseudotype recombinant 
type C retiwirus including but not limited to murine leukemia reteoviral vectors. In a furttier embodiment of ttie present 
invention ttiese hCATI binding envelopes can also be used to pseudotype lentiviral vectors including equine or HIV 
derived lentivnal vectors (Kim et al., 1 998; Rizvi and Panganiban, 1 992), (Kafri et al.. 1 997; PoeschIa et al 1996) (Miy- 
oshi et al.. 1997; NaWini et al., 1996b). Any hCATI ligannds or binding envelope molecules or parts ttiereof made 
according to ttie methods described herein or ottier mettiods can be ligated into full lengtti mammalian retroviral enve- 



30 



35 



40 



45 



SO 



BS 



2 



EP0 959136A1 

lope expression constructs and introduced in cell lines expressing and containing all the sequences necessary for the 
generation of infectious and functional retroviral particles, in a preferred embodiment the invention provides a virus-like 
particle or gene delivery vehicle derived from a retrovirus. 

[001 0] In yet another embodiment, the Invention provides a virus-like particle or gene delivery vehicle wherein said 
5 viral protein comprises a capsid protein. hCATI binding sequences or ligand can also be incorporated in the capsid pro- 
teins of adenovirus including but not limited to the HI loop of the knob domain of an adenovirus (Krasnykh et al., 1 998) 
preferably an adenovirus which does not bind to the adenoviral receptor CAR1 or MHC1 . This results in an adenovirus 
that enters cells through hCAT1 . Deduced from mCAT1 absent expression in mouse liver (Gloss et al.. 1993) an hCATI 
binding adenovirus does not exhibit liver transduction when administered in wvo. By combining an hCATI targeted 
10 knob with a ligand for another in vivo target hCATI , targeting of an adenovirus can remove an important limitation of in 
vivo use of adenoviral vectors for gene therapy (Sullivan et al., 1997). In another embodiment an hCATI targeted ade- 
novirus will more efficiently transduce cells that are difficult to transduce such as endothelial cells or smooth muscle 
cells as compared to a wildtype adenoviral vector including but not limited to an adenoviral vector derived from the ade- 
noviral serotype 5, the invention provides a virus-like particle or gene delivery vehicle derived from an adenovirus. 
15 [001 1 ] An hCATl targeted adenovirus is useful for local applications of adenoviral vector such as in patients with res- 
tenosis following balloon angioplasty where smooth muscle cells need to be transduced with for example an adenoviral 
vector carrying the ceNOS cDNA. More efficient transductton of these tissues results in lower multiplicity's of infections 
(MOIs) that can be used and therefore less vector associated toxicity to the tissues surrounding the target cells 
(PCT/EP98/00723). 

20 [001 2] In a preferred embodiment, the invention provides a virus-like particle or gene delivery vehicle according to the 
invention wherein said amino acid transporter is a cationic amino acid transporter, preferably a human cationfe amino 
acid transporter-1 (hCAT1) . In a prefen-ed example of the invention provided in the experimental part the InventkMi pro- 
vides a virus-like particle or gene delivery vehicle wherein said ligand comprises an antino acid sequence selected from 
Table 2, preferably from tiie last four different sequences of Table 2 or a sequence functionally related thereto. Various 

25 examples of a ligand having hCATI binding activity are provided, a particularly strong example is a ligand comprising 
at least a part of. comprising minimally 5. more preferalily minimally 7 amino acids of the amino acid sequence SVS- 
VGMKPSPRP. 

[0013] In yet another emtxxilment, the Invention provides a virus-like particle or gene delivery vehicle according to 
the invention wherein wherein said ligand comprises a fragment derived from a phage displaying at least one antibody 
30 fragment selected for its capacity to bind with said an^no acid transporter, in particular a virus-like particle or gene deliv- 
ery vehicle is provided wherein said antibody fragment comprises an amino acid sequence as shown in Figure 16 or a 
an amino acid sequence functionally equivalent thereto or obtainable by a metiiod as described in the experimental part 
of this description. 

[0014] The invention also provides use of a virus-like particle or gene delivery vehicle according to the invention in 

35 gene-transfer tiierapy In numerous gene therapy appltoatfons targeted delivery of genes into defined cells is provided 
by the invention, most notably in the case of in vitro gene transfer into cell types present with low abundance in cell mix- 
tures and in approaches for in vivo gene transfer into cells in a living animal body In a particular embodiment the par- 
ticles or vehicles provided by the invention are used for gene tiierapy using hCAT1 mediated gene transfer including but 
not limited to mammalian smootfi muscle cells or hemopoietic stem cells such as CD34+CD38- or 

40 CD34+(CD33CD38CD71)- cells, including but not limited to adenoviral or retroviral gene transfer vehicles. 

[001 5] The Invention also provkles a metiiod for selecting a filamentous phage expressing a protein capable off bind- 
ing to a ligand comprising constructing a phage library, enriching said library for phages having desired binding charac- 
teristics by at least one round of selection of phages for tiieir capacity to bind to a synthetic peptide derived from said 
ligand, further comprising enriching saki library for phages having desired binding characteristics by at least one round 

45 of selection of phages for tiieir capacity to bind to a cell expressing said ligand. 

[001 6] The invention for example provides a peptide phage display to select hCATi binding peptides for incorporation 
in a ligand. To isolate peptides that bind to the third extracellular domain of we employed peptkJe phage display. A 12 
mer peptide phage display library was purchased from New England Biolabs. This Itorary is constructed in the filamen- 
tous E. cdi phage ml 3 and the peptide sequences are displayed as N-terminal fusion proteins with the minor coat pro- 

50 tein pllL The unamplified library had a complexity of 1 .9 x 10^ different sequences as determined by the suppliers. We 
amplified the library once before using it to select hCATI binding peptide phages. Two targets were used to select for 
peptide displaying phages which bind to tiie ttiird extracellular domain of hCATI. Rrst the predicted third extracellular 
domain of hCATI was synthesised as a synthetic peptkie by Neosystem. Strassbourg, France. The N-termlnus off this 
peptide was Wotinylated and followed ty three amino acid linker reskJues KRR. followed by the predicted sequence off 

55 the third extracellular domain. Secondly we generated cell lines derived from the human 91 1 cell line that overexpress 
hCATI as judged by steady state mRNA expression levels. The hCATI expression const-uct hATRCCi which is a 
pcDNA3 based expression consfruct of tiie hCATI cDNA was employed to transfect 91 1 cell lines followed by selection 
for neomycine resistance. A cloned cell line designated k08 was isolated which expresses high levels of hATRCCi 
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derived hCATI mRNA, 

[001 7] The invention is furtlier described in tlie experimental part of this desaiption which is not limiting the invention 
thereto. 

Experimental part 

[0018] Retroviruses are RNA viruses which effidently integrate their genetic information into the genomic DNA of 
infected cells via a reverse-transcribed DNA intermediate. This property of their life-cycle and the fact that parts of their 
genetic material can be replaced by foreign DNA sequences make retroviruses one of the most promising vectors for 
the delivery of genes in human gene therapy procedures, most notably for gene therapies which rely on gene transfer 
into dividing tissues. Most retroviral vector systems are based on mouse retroviruses and consist of two components, 
i.e. (i) the recombinant retroviral vector carrying the foreign sequences of interest, and (ii) so-called packaging cells 
expressing the structural viral proteins of which the encoding sequences are lacking in the retroviral vector. Expression 
of (i) in 00 results in the production of recombinant retroviral particles capable of transducing susceptible target cella 
[001 9] The infectivity and host cell range of the retrovirus particle is conferred by an envelope glycoprotein which spe- 
cifically binds to a receptor molecule on the target cell membrane. The envelope glycoprotein of all known retroviruses 
consists of two associated peptides, which are derived by proteolytic cleavage from the same precursor protein 
encoded by the retroviral envelope (env) gene (Gunzburg and Salmons, 1996; Weiss, 1996) . The amino terminal 
domain enconpasses specific binding site(s) for its receptor on the target cell membrane, determining the virus host 
range. Within this domain of about 200 amino acids highly conserved sequences are present that are interrupted by two 
segments designated VRA and VRB which vary in sequence and length among various mammalian type C retroviruses 
(Battini et al., 1992). The carboxy terminal peptide, which contains trans-menUarane anchor sequences, is assumed to 
account for the selective uptake of the envelope glycoprotein in the virus particle and to mediate fusion between the 
virus membrane and - depending on the type of virus - the plasma membrane or intracellular vesicle membrane of the 
target cell (JanuszesW et al.. 1997; Thomas et al., 1 997). In figure 1 a schematic representation of the structure of MuLV 
env protein is given. Several envelope glycoprotein variants with different infection spectra for mammalian cells have 
been identified (Battini etal., 1992). 

[0020] There are examples or recombinant viruses carrying an amphotropte or GaLV envelopa Recombinant viruses 
can-ying an amphotropic or GaLV envelope are capable of infecting human and murine cells and are commonly used in 
gene transfer trials including human gene therapy involving the pluripotent hemopoietic stem cell (PHSC) (Havenga et 
al.. 1997). Gene transfer frequencies into PHSCs of human patients and non human primate animal models have been 
shown to be extremely low and limit therapeutic stem cell gene therapy (Havenga et al., 1997; Hoogert)rugge et al.. 
1996; Van Beusechem et al.. 1993; van Beusechem et al., 1992). 

[0021] One important limiting factor has been shown to be low expression levels of retroviral receptors such as the 
one mediating entry of amphotropic MuLV retrovirus (GLVR2) (Orlic et al., 1996; van Es et al., 1996). The quiescent 
state of PHSCs when isolated for ex vivo gene transfer procedures poses another blockade (Knaan-Shanzer et al., 
1996). Murine stem cell gene therapy experiments have traditionally been performed with ecotropic MuLV vectors 
(Havenga et al.. 1997). Recombinant viruses carrying an ecotropic envelope are only capable of infecting murine cells. 
Transfer of genes into murine PHSCs using ecotropic retroviral vectors has been shown to result in high transduction 
efficiencies in circulating PHSC derived peripheral Wood cells (PBL). The transduction efficiencies are high enough to 
be therapeutic If achieved In human PHSCs reaching levels of PHSC gene transfer varying between 30-80 %. 
[0022] A small number of studies have been performed In which the transduction efficiency into murine PHSCs of eco- 
tropic and amphotropic retroviruses were actually compared directly (Havenga et al., 1997). One of these studies indi- 
cated that infection with amphotropk; retrovirus resulted in expression and thus transgene presence for less than 8 
weeks whereas infection with ecotropic virus resulted in expression for more than 44 weeks after transplantation 
(Demarquoy, 1 993). In a similar study, ecotropic virus was shown to be approximately 50 fold more efficient in transduc- 
ing murine PHSCs as conpared to amphotropic retrovirus (Orlic et al., 1 996). 

[0023] Ecotropic and amphotropic retrovirus differ in the receptor that is employed for virus entry (Albritton et al., 
1989; van ZeijI et al., 1994). Ecotropic virus binds target cells via the ecotropic receptor mCATI which is a transporter 
of cationic L-amino acids (Kim et al., 1991) and amphotropic retrovirus binds target cells via the amphotropic receptor 
GLVR2. a sodium dependent phosphate transporter GLVR2 (Kavanaugh et al., 1994; Miller and Miller. 1994; van Zeiil 
etal., 1994). 

[0024] A comparative study measuring mRNA levels of both the ecotropic and amphotropic receptors in mouse 
PHSCs (lit o-Wtbright) revealed an important difference. This study demonstrated that ecotropic receptor (mCATi) 
mRNA levels in these cells are high whereas amphotropic receptor (GLVR2) mRNA levels were undetectable by RT- 
PCR (Orlic et al., 1996). GLVR2 expression studies on CD34+(CD38,CD33.CD71) (CD34^lin cells) isolated from 
human bone marrow, umbilical cord blood and immobilised peripheral blood supports these data (van Es et al.. 1996). 
[0025] Another important factor which plays a role in determining successful retroviral entry and integration is the 
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postbinding route of entry o1 a retrovirus particle. The postbinding entry route for ecotropic virus is different from that of 
amphotropic retrovirus. Ecotropic retrovirus tranductions are sensitive to lysosomotropic agents such as chloroquine 
and NH4C(. This suggests that upon binding of the ecotropic retrovirus, the retrovirus is internalised by receptor medi- 
ated endocytosis (McClure et al., 1990). In contrast upon binding of the envelope of amphotropic retrovirus the viral 
5 envelope directly fuses with the plasma membrane. This is a process that is not disrupted by lysosomotropic agents 
suggesting that the postbinding steps of amphotropic MuLV virus are essentially different from those of ecotropic MuLV 
retrovirus (McClure et al., 1990). 

[0026] The human homologue of the murine ecotropic virus receptor mCATI is hCATI . Like mCATI mRNA expres- 
sion in mouse PIHSCs. hCATI mRNA is expressed at high levels in human PHSCs (Orllc et al.. 1996). For both mCATI 
10 and hCAT1 the normal function Is the import of cationic amino acids such as lysine and arginine (Albrltton et al., 1993; 
Malholra et al., 1996). The third predicted extracellular domain of mCATI includes a sequence YGE. The residues are 
crucial for receptor function. In the nonfunctional hCATI the sequence of the third extracellular domain Is PGV. Mutation 
of the human sequence into one or two of the residues of mCATI results in a hCAT1 protein with ecotropic receptor 
function (Albritton et al., 1 993; Yoshimoto et al., 1 993). See also figure 2. 
75 [0027] A number of mutant ecotropic envelope molecules have been described in the literature. MacKrell et al have 
nnutated amino acids within the receptor-binding domain VRA of ecotropic MuLV envelope in order to identify residues 
involved In receptor binding. Virions incorporating mutant envelopes carrying mutations at amino acid residue D84 have 
lost their binding capabilities to the ecotropic receptor mCATI (MacKrell et al., 1996). Virions carrying D84 mutated 
envelope protein were tested on human cells to search for a possible change in receptor recognition specificity but were 
20 found not to infect human cells (Mike Januszeski. personal communication). 

Skov and Andersen have studied ecotropic Moloney envelope interactions with mCATI by generation of mutant enve- 
lope molecules with mutated arginine and lysine residues in gp70 including VRA followed by introduction in a replication 
competent retroviral backbone (Skov and Andersen. 1993). Mutations R135G, K137Q. R157G and R159A 
(R102G,K104Q.R124G and R126A without signal peptide respectively) resulted in virions that were not able to repli- 
es cate. 

Kingsman et al have described in PCT application WO96/31602 an insertion site in the VRA domain of ecotropic enve- 
lope which allows modification of the tropism. An integrin binding sequence was inserted resulting in infection of human 
cells expressing the respective integrin. 

[0028] PVC-21 1 murine leukemia virus (MuLV) is a neuropathogenic variant of ecotropic Friend MuLV (F-MuLV) that 
30 causes a rapidly progressive neurodegenerative disease in susceptible rodents. PVC-21 1 MuLV, but not the parental F- 
MuLV, can infect rat brain capillary endothelial cells (BCEC) efficiently, and the major determinant for BCEC tropism of 
PVC-21 1 MuLV is localized within the env gene. More specific analysis indicated that El 16G and E129K substitutions 
in the background of the F-MuLV envelope protein were sufficient for confen-ing BCEC tropism on the virus (Masuda et 
al.. 1996a). Host range changes of these mutations were found to include CHO cells normally not infectable with eco- 
35 tropic F-MuLV or M-MuLV. The latter suggests that these mutations overcome a negative effect of CAT1 CHO cell recep- 
tor glycosylation in the region of virus binding in the third extracellular domain of mCATI (Masuda et at.. 1996b). 
[0029] By employing particular natural env variants the transduction spectrum can be limited to some extend, but true 
specificity for human target cells of interest can not be obtained following this strategy (Masuda et al.. 1996a; von Kalle 
et al.. 1994; Wilson et al., 1994). 
40 [0030] In the present invention we describe the expansion of the host range of an ecotropic retrovirus or other gene 
transfer vehicle such as an adenoviral vector resulting in increased transduction of hemopoietic stem cells. In this inven- 
tion, targeted delivery is accomplished by directing the retrovirus particle to cell membrane molecules differing from the 
natural receptor. This could then lead to increased specificity of transduction. 

[0031 ] The present invention discloses examples of nrK>lecules that bind to hCATI and that can be used to develop 
45 gene transfer vehicles such as retroviral and adenoviral vectors. In particular, the invention relates to proteins and deriv- 
atives thereof expressed in the lipid bilayer of enveloped virus particles such as retroviruses. Methods, materials, pro- 
cedures and pharmaceutical formulations for the design and preparation of the above molecules and virus particles are 
also part of the inventbn. These molecules and virus particles have applteations In the field of virology, gene tiierapy. 
biochemistry and molecular biology. 
so [0032] The present invention relates to peptide molecules binding to hCATI . These molecules are characterized by 
their ability to bind the third extracellular domain of tiie cationic amino acid transporter hCATI eitiier a synthetic peptide 
encompassing this third extracellular domain or by binding to cells expressing this domain of hCAT1 protein on their 
exfacellular celt surface. These hCATI binding molecules can be peptides or antibody fragments displayed on a fila- 
mentous phage or as free molecules. 
55 [0033] Included in the present invention are filamentous phages displaying hCATl binding TTK>lecul6S and tinat can be 
used to transfer gen s into cells by modification of the phage genome using techniques known in the art. 
[0034] Included in the present invention is the use of hCATI binding molecules to design vectors that employ hCATI 
to enter a HSC or any other cell expressing hCATI . hCATI binding molecules can be incorporated in tiie envelope of a 
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retrovirus or the capsid of any other viral or non-viral gene transfer vehicle such as an adenoviral vector. Incorporation 
of these hCAT1 binding sequences can be done using techniques known in the art. 

[0035] Preferred are mutant retroviral envelopes that are derived from wild-type ecotropic envelope and which employ 
hCAT1 to enter the human or primate cell by binding to hCATI. These new retroviral envelope molecules, when incor- 
porated in a retroviral virion, will be able to infect hCATI positive cells such as human PHSCs at high efficiencies. The 
mutant retroviral envelopes can be used topseudotype recombinant type C retrovirus including but not limited to murine 
leukemia retroviral vectors. In a further embodiment of the present invention these hCATI binding envelopes can also 
be used to pseudotype lentiviral vectors including equine or HIV derived lentiviral vectors (Kim et al.. 1998; Rizvi and 
Panganiban. 1992), (Kafri etal.. 1997; Poeschia etal.. 1996). (Miyoshi etal.. 1997; NaJdini etal.. 1996b). 
[0036] Any hCATI binding envelope molecules or parts thereof made according to the methods described herein or 
other methods can be ligated Into full length mammalian retroviral envelope expression constructs and introduced In 
cell lines expressing and containing all the sequences necessary for the generation of infectious and functional retrovi- 
ral particles including but not limited to cell lines preferably derived from the adenoviral El transformed, human cell line 
PER.C6 (WO97/00326) and that express murine leukemia gag-pol constructs and a retroviral vector containing long 
terminal repeats (LTRs). and retroviral RNA packaging signals such as those vectors described In W096/35798. The 
hCATI binding envelopes made according to the sul)ject material of this Invention can also be used to pseudotype vec- 
tas other than murine leukemia retroviral vectors including but not limited to lentiviral vectors (Naldini et al 1996a- 
NaMini etal.. 1996b). 

[0037] In a further embodiment of the present invention, hCATI binding sequences can also be incorporated in the 
capsid proteins of adenovirus including but not limited to the HI loop of the knob domain of an adenovirus (Krasnykh et 
al., 1998) preferably an adenovirus which does not bind to the adenoviral receptor CAR1 or MHC1. This results in an 
adenovirus that enters cells through hCATl. Deduced from mCATI absent expression in mouse liver (Gloss et al.. 
1993) an hCATI binding adenovirus does not exhibit liver transduction when administered in vivo. By combining an 
hCATI targeted knob with a ligand for another in vivo target hCATI targeting of an adenovirus can remove an important 
limitation of in vivo use of adenoviral vectors fbr gene therapy (Sullivan et at. 1997). In another embodiment an hCATI 
targeted adenovirus will more efficiently transduce cells that are difficult to transduce such as endothelial cells or 
smooth muscle cells as compared to a wildtype adenoviral vector including but not limited to an adenoviral vector 
derived from the adenoviral serotype 5. 

[0038] An hCATI targeted adenovirus is useful for local applications of adenoviral vector such as in patients with res- 
tenosis foltowing balloon angioplasty where smooth musde cells need to be transduced with for example an adenoviral 
vector carrying the ceNOS cDNA. More efficient transduction of these tissues results in lower multiplicity's of infections 
(MOIs) that can be used and therefore less vector associated toxicity to the tissues surrounding the taraet cells 
(PCT/EP98/00723). ^ 
[0039] In another aspect of the present invention, the hCATI binding human FAbs that are part of the subject matter 
of this invention can be used to measure expression of hCATI molecules on cells that are targets for gene therapy using 
hCATI mediated gene transfer including but not limited to mammalian hemopoetic stem cells such as CD34+CD38- or 
CD34+(CD33CD38CD71)- cells. This could be part of a procedure aimed at determining when or whether a patients 
cells are most susceptible to gene transfer through hCATI including but not limited to adenoviral or retroviral gene trans- 
fer vehicles. 

[0040] The skilled artisan will be able to apply the teaching of the present invention to other virus capsid or envek)pe 
or non-viral gene transfer molecules or vehicles than those exemplified herein without departing from the present inven- 
tion and therefore the examples presented are illustrations and not limrtations. It is intended that all such other examples 
be included within the scope of the appended claims. 

Example 1 . Sequences of hCATI cDN As amplified from human CD34+ cells. 

[0041 ] For the purpose of developing gene transfer tools that enter PHSCs through hCATI , specifically through bind- 
ing to the third extracellular domain, we isolated total RNA from a number of different human CD34+ samples and deter- 
mined the cDNA sequence of hCATI (see figure 2). Total RNA was isolated accoiding to the protocol described by 
ChomczynsW et al (Chomczynski and Sacchi. 1987), RT-PCR was performed by using the Superscript Preamplrfication 
System for First Strand cDNA Synthesis (Life Technologies). For first strand synthesis random hexamers were used. 
The hCATI cDNA was amplified with two sets of primers, each resulting in a product of approximately 1 kb encompass^ 
ing the open reading frame of the hCATI mRNA (Yoshimoto et al.. 1991). DNA sequencing was performed by Base- 
Clear. Leiden. The Netherlands using automated sequence analysis. In figure 3a and 3b the residts of sequence 
analysis of hCATI cDNA isolated from CD34+ cells from mobilized peripheral blood or umbilical cord blood are com- 
piled. Clearly firom thes nucleotide sequence analyses (figure 3b ) it can be deduced that indeed in the CD34+ samples 
tested hCATI is expressed and includes the third extracellular domain with predicted sequence KNWQLTEEDFGNTS- 
QRLCLNNDTKEGKPQVGQF which includes the sequence PGV determining function as receptor (see above) . There- 
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fore targeting through this domain or part of this domain of hCATI in hemopoietic CD34* cells including but not limited 
to hemopoietic stem cells such as defined by lineage negative phenotypes e.g. CD34+CD38 is possible. 

Example 2. Peptide phage display to select hCATI binding peptides. 

5 

[0042] To isolate peptides that bind to the third extracellular domain of hCAT1 (Albritton et al.. 1993) (figure 2) we 
employed peptide phage display. A 12 mer peptide phage display library was purchased from New England Biolabs. 
This library is constructed in the filamentous E. coli phage m13 and the peptide sequences are displayed as N-terminal 
fusion proteins with the mina coat protein pill. The unamplif ied library had a complexity of 1 .9 x 1 0^ different sequences 

10 as determined by the suppliers. We amplified the library once before using It to select hCATI binding peptide phages. 
Two targets were used to select for peptide displaying phages which bind to tfie thiixJ extracellular domain of hCATI . 
Rrst the predicted third extracellular domain of hCATI was synthesised as a synthetic peptide by Neosystem, Strass- 
bourg, France, The N-terminus of this peptide was biotinylated and followed by three amino acid linker residues KRR, 
followed by the predicted sequence of the third extracellular domain (figure 4). Secondly we generated cell lines derived 

15 from the human 91 1 cell line that overexpress hCATI as judged by steady state mRNA expression levels. The hCATI 
expression construct hATRCCI which Is a pcDNAS based expression consfruct of the hCATI cDNA (Malhotra et at., 
1996)was enployed to transfect 911 cell lines followed by selection for neomycine resistance in 1 mg/ml of G418 
(Genetecin, Ufe Technologies, Inc). A cloned cell line designated k08 was isolated which expresses high levels of 
hATRCCI derived hCATI mRNA (figure 5). 

20 [0043] To select for peptide displaying phages that bind to the putative tiilrd extracellular domain of hCATI as 
expressed on human cells the following selection procedure was employed. Six rounds of selection on biotinylated 
hCATI peptide (figure 4) followed by three rounds of selection on hCATI overexpressing cells W)8. Initially two separate 
selections were canied out differing In the stringency of washing. Low stringency washing consisted of 3 washes with 
2 % (w/V) milk powder in PBS with 0.05 % (vh) Tween 20 and 3 washes with PBS. High stringency washing consisted 

25 of 5 washes with 2 % (w/v) milk powder, PBS with 0.05 % Tween 20. 5 washes with PBS, 0.05 % Tween 20 and 5 
washes with PBS. After 1 round of selection on 911-hCAT1-k08 cells eluted phages from both washing procedures 
were pooled and used for a second and third round of selection on 91 1 -hCAT1-k08 cells. The results of these experi- 
ments are depicted in table 1. Clearly the ratio of input over output increases upon selection on hCATI peptide indica- 
tive of selection for binding phages. When selection on hCATI positive cells was started tfie ratio drops and slightiy 

30 increases in tiie last round on hCATI expressing human cells. 

[0044] After the last round of selection witti the hCATI peptide and after each round of cell selection tfie pools of pep- 
tide displaying phages were tested for binding to immobilized hCATI peptide using an Enzyme Unked Immunosorbent 
Assay (ELISA). 96-well plates were coated with 2 mg/m! biotinylated BSA in PBS and incubated for 1 hour 37 '^C after 
which the wells were rinsed 3x for 5 minutes with PBS/0.05% Tween 20. Then the wells were saturated with streptavidin 

35 (10 mg/ml in PBS/0.5% gelatin) for 1 hour at room temperature (RT) and washed 3 times wfth PBS/0.05 % Tween 20. 
Then the wells were incubated overnight at 4 »C with biotinylated hCAT1 peptide (figure 4) at a concentration of 10 
mg/ml in PBS. The next day the wells were rinsed two times with PBS/0. 1 % Tween 20 and 2x witti PBS. Then ttie wells 
were bk)cked witti 2% non-fat milkpowder in PBS for at least 30 minutes at RT followed by three rinses with witii PBS/0. 
1 % Tween and three with PBS. Subsequentiy an equal volume of 4% non-fat milkpowder in PBS was added to all wells 

40 and culture supernatant or purified phage (PEG precipitated) and incubated for 1 .5 hours at RT After ttiis incubation 
the wells were washed three times with PBS/0.1 % Tween 20 and ttiree times with PBS followed by incubation witti an 
anti-mi 3 antibody solution (Phamiacia. 1 :5000 In 2% non-fat n^Ikpowder In PBS) for 1 hour at RT Again tine wells were 
washed ttiree times witti PBS/0.1 % Tween and ttiree times witti PBS followed by ttie addition of a rabbit-anti goat HRP 
conjugate solution (BloRad. 1:2000 in 2% non-fat milkpowder In PBS) for 1 hour at RT After this incubation the wells 

45 were washed again three times with PBS/0. 1% Tween and ttiree times witti PBS. Detection of phage binding was then 
visualized using TMB colour solution (0. 1 mg/ml TMB, 1 % DMSO. 1 x TMB buffer, 0.001 % 30% H2G2 in H2O) 20^0 min 
in the dark at RT and stopped witii 2 N H2SO4 and read at 450 nm in a microplate reader. Using ttils hCATI specific 
ELISA an enrichment of phages binding to hCATI peptide is achieved (figure 6). Importantly after binding of the peptide 
selected pools to hCATI overecpressing cells eluted phages still bind to hCATI peptide Clones isolated from round 3 

50 on hCATI overexpressing cells were isolated and tested on hCATI peptide ELISA (figure 7). Except 1 , all tested ctones 
bound to hCATI peptide and tiius to ttie third extracellular domain displayed on human cells. 
[0045] To confirm enrichment for specific sequences and to determine tiie amino acki sequence of ttie 12 mer pep- 
tides displayed, we isolated single stranded ml 3 phage DNA for automated sequence analysis (Baseclear, Leiden, The 
Netherlands). The oligonucleotide used for sequencing was 5'-CCCTCATAGTTAGCQTAACQ-3\ We sequenced clones 

Bs Isolated from ttie pools of various peptide and cell selections. For ttiis purpose we pooled ttie eluates of ttie two different 
washing conditions. In addition to the amplified 12 mer peptide library w only selected clones from peptide rounds 3 
.5 and 8 and cell rounds 1 ,2 and 3. In table 2 ttie sequences determined for ttie various clones are given. Clearly a very 
strong selectton occun-ed because all cell selected phage clones displayed one sequence namely: SVSVGMKPSPRR 
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This sequence is also displayed by phages in hCATI peptide selected pool 6 in a mixture with 3 other sequences. 
These other phages are lost once the phage pools selected on hCAT1 peptide are selected for binding to hCAT1 over- 
expressing cells. 

[0046] The cloned SVSVGMKPSP RP displaying ml 3 phage was used in experiments to measure binding of the dis- 
s played sequence to cells that express hCATI . First we did an experiment using the flow cytometer and the 2 cell lines 
91 1 i3cDNA3 and 911 -hCATI (k08) . Cells were incubated with 10^^ phage inl 00 ml PBS/0. 1% BSA for 1 hour at room 
temperature. Subsequently the cells were washed twice with PBS/0.1 % BSA followed by incubation of the cells with 
anti-mi 3 antibody (Pharmacia. 1:500 in PBS/0.1 % BSA) for 30 min at room temperature and washed twice with 
PBS/0.1 % BSA. Then the cells were incubated with rabbit-anti goat FITC (DAKO, 1 :50 in PBS/0.1 % BSA) for 30 min 
10 at RT and washed twice with PBS/0. 1 % BSA. Binding of phage was then measured in the FL1 channel of a Becton and 
Dickinson flow cytometer. As a control we used an identical amount of phage from the amplified 12 mer tibary. In figure 
8 the results of this experiment are depicted. Clone #26 phage binds to 91 1 cells and in particular to 91 1 cells that over- 
express hCATI. 

[0047] We also measured cell binding of phage incubating hCATI expressing cells with phage followed by trtering total 
75 ceil bound phage, eluted phage and cell associated phage fractions on E.coli using a standard ml 3 plating assay on a 
lawn of E.coli cells. For this purpose E.coli strain ER2537 is grown overnight in LB medium. This overnight culture is 
then used to Inoculate 20 ml of fresh LB medium at an ODgoonm of 0.05. Once at an ODeoonm of 0.5 500 ml of the 
ER2537 E.coli bacteria were mixed with 500 ml dilutions of phage samples and incubated at RT for 10 min. Plating on 
a standard LB-agar plate was performed by mixing 3 ml top agar with 200 ml of each sample. Once the topagar was 
20 solidified the plates were transferred upside down to a 37 *'C incubator for 1 2-1 4 hours. Plaques were counted and used 
to determine the number of phage particles binding to hCATI expressing cells. In table 3 cell binding is depicted. 
[0048] Clearly from these results we can conclude that the 1 2 mer peptide displaying phage with sequence SVSVQ- 
MKPSPRP indeed binds to hCATI expressing cells. hCATI expressing Cells were also incubated at 37 *»C followed by 
elution of bound phages plus cell associated phages were liberated. Both were used in phage litering on E.coli and 
2S Clearly a cell associated fraction is detected. This suggests that the phage displaying sequence SVSVGMKPSPRP and 
which bind to hCATI also enter a human hCATI expressing cell. This feature of sequence SVSVGMKPSP RP could be 
used for the development of gene transfer products useful in gene therapy. 

Exanple 3. Human FAb phage display to select hCATi binding human antibody molecules. 

30 

[0049] To isolate antibodies that bind to the third extracellular domain of hCATI (Albritton et al., 1993) (figure 2.4) we 
employed phages displaying human FAb fragments encompassing the light arKJ heavy variable and constant regions. 
A human FAb phage display library was constructed in phage display vector pCESi a vector derived from pCANTABS 
(McGuiness et al., 1996). The library was constructed in the filamentous E. coli phage ml 3 and the FAb sequences are 

35 displayed partly as N-terminal fusion proteins with the minor coat protein pill. TTie unamplified library had a complexity 
of approximately 3.3 x 10^^ different sequences. Two targets were used to select for peptide displaying phages which 
bind to the third extracellular domain of hCAT1 . First the predicted third extracellular domain of hCAT1 was synthesized 
as a synthetic peptide by Neosystem. Strassbourg. Franca The N-terminus of this peptide was biotinylated and fol- 
lowed by three annno add linker rescues KRR. followed by the predicted sequence of the third extracellular domain 

40 (figure 2,4). Secondly we generated cell lines derived from the human 91 1 cell line that overexpress hCATI as judged 
by steady state mRNA expression levels. The hCATI expression construct hATRCCI which is a pcDNA3 based expres* 
sion construct of the hCATI cDNA was employed to transfect 91 1 celt lines followed by selection for neomydne resist- 
ance in 1 mg/ml of G418 (Genetlcin, Life Technologies, Inc) . A cloned cell line designated k08 was isolated which 
expresses high levels of hATRCCI derived hCATI mRNA (figure 5). 

45 [0050] To select for FAb displaying phages that bind to the putative third extracellular domain of hC ATI as expressed 
on human cells the following selection procedure was enrptoyed. Four rounds of selection on biotinylated hC ATI pep- 
tide (figure 4) followed by three rounds of selection on hCATI overexpressing cells k08. For selection on biotinylated 
hCATI peptide 250 ml of FAb library (or eluted phage from the previous round) was mixed with 250 ml 4% h/larvel in 
PBS and equilibrated while rotating at RT for 1 hour. Subsequently biotinylated hCATi peptide (20-500 nM in H2O) was 

50 added. This mix was incubated on the rotator at RT for 1 hour before 250 ml equilibrated streptavidin-dynabeads in 2% 
Marvel in PBS was added. After incubation on a rotator at RT for 1 5 min tiie beads with tiie bound phage were washed 
5 times with PBS/2% Marvel/0.1% Tween. 5 times with PBS /0.1% Tween and 5 times with PBS. Then the phage were 
eluted by incubation with 0. 1 M Tri-ethyl-amine on a rotator at PT for 1 0 min and neutralised In 1 M Tris-HCI pH 7.4. The 
eluted phage were titered and amplified in TG1 before the next selection. For selection on 91 1-hCAT1 cells, tiie cells 

55 were harvested at a conf luency of about 80 % and suspended in PBS/1 0 % FBS/2 % Marvel to a final concentration of 
at least 3x10^ cells/ml. This ceil suspension was incubated for 30 min on a rowing boat mixer (or rotator), while at tiie 
same time phage were also preincubated in PBS/10 % FBS/2 % Marvel. Then the cells were centrifuged. resuspended 
in tiie preincubated phage solution and incubated on a rowing boat mixer (or rotator) for 1 hour. Aftenwards the cells 
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were washed 10 times with PBS/10 % FBS/2 % Marvel and twice with PBS. The cells were centrifuged and resus- 
pended In 0.6 ml water. Subsequently 0.6 ml 200 mM triethylamine was added (dropwise while vortexing). After 5 min- 
utes the suspension was neutralised by adding 0.6 ml of 1 M Tris-HCI pH 7.4 (dropwise white vortexing).After 
cenrifugation (5 min. 14000 rpm) the supernatant was transferred to a new tube and titered and amplified in TG1 before 
5 the next selection. The results of these experiments are depicted in table 4. clearly the ratio of input over output 
increases upon selection on hCATI peptide indicative of selection for hCATI peptide binding phages. When selection 
on hCATI positive cells was started the ratio dropped and slightly increased in the last round on hCATI expressing 
ceils. 

[0051] The pools of the last 3 rounds were tested for binding to the biotinylated hCATI peptide in a hCATI specific 

10 ELISA and also for cell binding by flow cytometric analysis (both protocols are described In example 2). After the last 
round of selection on cells the pool of FAb phages still binds to the biotinylated hCATI peptide (figure 9). Flow cytomet- 
ric analysis showed that this pool also binds to hCAT1 overexpressing cells (figure 10). From this pool 43 clones were 
analysed by fingerprint analysis and divided into 14 different groups. From each group 1 phage clone was tested for 
binding to the biotinylated hCAT1 peptide in a hCAT1 specific ELISA and also for cell binding by FACS analysis. Three 

15 clones appeared to be streptavidin binders whereas the other 1 1 clones showed binding to the biotinylated hOATI pep- 
tide (figure 11). Row cytometric analysts revealed that only 1 of tiie 14 clones showed strong binding to hCATI overex- 
pressing celts (figure 12). This clone was analyzed in more detail (figures 13,14). Clearly clone #25 binds strongly to 
the synthetic hCATI peptide used and to hCATI overexpressing 91 1 k08 cells. Moreover average fold increased binding 
of this phage to 91 1 -hCATI -k08 overexpressing cells over 91 1 -pcDNAS cells was found to be 1 .6 ± 1 .2 fold (figure 1 4). 

20 Double strand phagemid DNA was prepared and used to determine the nucleotide and deduced amino acid sequence 
of the displayed variable heavy and light chains. For a schematic picture of the vector pCES1 in which the library of var* 
table chains was cloned (see figure 15). The hCATI binding domains are as expected homologous to human immu- 
noglobulin sequences. The complementarity determining regions (CDRs) are indicated in figure 16. 
The sequences of this immunoglobulin can be incorporated in viral or non-viral proteins that mediate binding and entry 

25 to cells and thus create gene transfer vehicles that enter cells through hCATi . The hCATI binding human FAbs can also 
be used to measure expression of hCATI on cells that are targets for gene therapy using hCATI mediated gene trans- 
fer. 

Example 4 Incorporation of hCATl binding peptides in ecotropic retroviral envelope 

30 

[0052] To include hCATI binding peptides (see example 2) in the context of an ecotropic murine leukemia viral enve- 
lope we used functional display of ecotropic envelope by filamentous phages. We used the constmct gplll/env2 which 
encodes a fusion protein consisting of a prokaryotic signal peptide and all of the gp70 protein including the variable 
regions A and B and the polyprollne hinge (amino acid residues 34-308) fused to the capsid protein encoded by gene 

35 ill of ml 3. Numbering of amino acid sequences was done according to the unprocessed envelope sequence as depos- 
ited in the Swiss prot database with accession number P03385 and starting from the viral signal peptide. In table 5 all 
the oligonucleotides are depicted that are used for Insertion of the peptide sequences in retroviral envelope. 
[0053] Three sites and ways of peptide insertion have been chosen: (1) Insertion at the BstEll site of the ecotropic 
envelope. (2) Replacement of sequence PFSS (residues 96-99) by each of the 4 the peptides (see table 5). (3) 

40 Replacement of sequence LTSLTP (residues 1 22-1 27) by each of the 4 peptide sequences peptides (see table 5). The 
sequences PFSS and LTSLTP are predicted to be on displayed on the outside of the envelope protein as deduced from 
the structure of crystallised Friend ecotropic envelope (Fass et al., 1997). For the BstEll insertion constructs the two sin- 
gle stranded complementary oligonucleotides were synthesised. At the amino acid sequence level linker amino acids 
were included at the amino and carboxyterminus of the inserted peptide sequence. These single stranded oligonucle- 

45 otides were then mixed in equimolar fashion heated to 95 and slowly cooled to room temperature to allow hybridisa- 
tion of the complementary molecules to double stranded DNA. Annealing was followed by BstEll digestion and 
separation on a 2 % agarose gel run In TAE-buffer. DNA was then excised from the gel and purified using Qiaquick gel 
extraction kit (Qiagen.Qermany). At the same time double stranded phagemid DNA of construct gplll/env2 was 
digested and thus linearized with BstEll. Linearized gpll l/env2 DNA was subjected to an incubation with the thermosen- 

50 sitive alkaline phosphatase TSAP (Life technologies), then mixed in various molar ratios with double stranded BstEll 
digested oligonucleotides encoding any of the 4 hCAT1 binding peptides (see table 5b).71ien 1 unit of T4 ligase and T4 
llgase buffer supplemented wit 1 mM ATP as added. The ligation mixture was inculjated for 1 hour at + 20 *C. The liga- 
tion mixtures were tiien transformed into Max DH5a competent t)acteria (Life technologies). Ampiciliin resistant colo- 
nies were picked and subjected to a PGR with one of the 4 primers in table 5c and primer Ecoenvl 2 (see table 5). This 

55 PGR allows one to determine the nature of the inserted sequence and its orientation. Ptasmid DNA of colonies with cor- 
rect orientation of Insert DNA was tfien isolated using Qiagen columns and sequenced (Baseclear. Leiden) to confirm 
the complete sequence of the inserts and boundaries plus tiieir orientation. 

[0054] For tiie insertion/deletion of hGATI binding peptide sequences into gplll/env2 at the LTSLTP and PFSS posi- 
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tions two fragments were amplified (primary PGR) using Elongase polymerase and the following two pairs of primers: 
Fragment 1 : Eco nv17 (sense primer, table 5c) plus one of the even numbered oligonucleotides in table 5a. Fragment 
2: Ecoenv12 or ecoenvOS (antisense, table 5c) plus an odd numbered primer in table 5a. Fragment 1 harbours at the 
DNA lev I the 3' end whereas fragment 2 harbours the peptide insertion at the 5* end. Because both fragments have 

5 identical 3' (fragment 1) and 5* ends (fragment 2) they can be used to assemble a full double stranded DNA fragment 
encompassing the ecotropic envelope sequence between and including part of the ecoenvIT and ecoenv12 oligonu- 
cleotide sequences. This is done by first purifying the two fragments from the primary PGR using Qiaquick PGR purifi- 
cation columns to remove all remaining primers followed by doing a PGR using the two overlapping fragments, and 
primers ecoenv17 and ecoenv12, and all the components necessary for DNA amplHicatlon using Elongase. This step 

10 results In the assembly of a fragment harbouring the 1 2 mer hCATI binding peptide insertions and result In the deletion 
of the LTSLTP or PFSS sequence. These fragments are purified and digested with NotI and PinA1 resulting in a DNA 
fragment of approximately 519 basepairs which was isolated from an agarose gel using Qiagen DNA isolation kit. The 
519 basepair fragments were then ligated into a NotI and PinAI digested gplll/env2 Surfscript fragment of approxi- 
mately 4000 basepairs using T4 ligase as described atK>ve in example 3. E coli bacteria are then transformed with the 

15 ligation mixture, amplclllin colonies picked, plasmid DNA isolated and analyzed for the presence of 51 9 basepair inserts 
using Not1 and PinAI restriction enzymes and DNA agarose gel electrophoresis. Plasmids with appropriate inserts 
were then further verified by automated DNA sequencing of the inserts (Baseclear. Leiden). 
[0055] Phages displaying envelope with the various peptide inserts can then be produced and tested for their binding 
to and entry of hCATI expressing cells and compared to phages displaying the gpll l/env2 construct The hCATI binding 

20 envelopes can then be used to develop retroviral vectors produced by mammalian cell lines. 

Example 5 Soluble FAb generation and binding to human cells 

[0056] To prepare soluble FAb fragments of the hGATI binding FAb phage clone periplasmic fractions were made 
25 from HB2151 bacteria Infected with done # 25 phage. Infected bacteria were grown In LB medium with 2% glucose and 
100 mg/ml ampidllln (LBGA) overnight while shaking at 30 °G. The next day the Infected cells were diluted 1 :100 in 50 
ml fresh LBGA and grown at 37 °G until the OD500 was 0.8. Bacteria were then pelleted followed by resuspension in 25 
ml LB, 1 00 mg/ml ampidllln and 1 mM IPTG and Incubation for 3 hours at 30 ^G while shaking vigorously Then the bac- 
teria were pelleted and resuspended in 1 ml ice-cold PBS followed by a 14-16 hour Incubation at 4 *C while rotating. 
30 The next day the periplasmic fraction was deared from bacterial residues by centrifugatlon: once for 1 0 minutes at 8000 
rpm G (Eppendorf centrifuge #5402) at 4 «C followed by a spin of 10 minutes at 14000 rpm. 4 °G . Then the FAb peri- 
plasmic fractions were aliquoted and stored at -20 '^G. The presence and expression of FAb fragments was confirmed 
by doing a dot blot and protH'ng for human kappa light chains with anti-human kappa polydonai rabbit antibodies (Dako 
A0191 . 1 rIOOO dilution, 60 minutes) and anti rabbit IgG (H + L) antibodies conjugated with horse radish peroxidase (Bio- 
35 rad. 170-6515, 1 :20.000, 60 minutes). Each step was followed by washing 6 times with PBS. 0.05 % Tween 20 (v/v). 
Final detection of human FAbs was done using EGL staining (Amersham). This revealed the presence of high concen- 
trations of soluble FAb fragment of hCATI binding done # 25. Dilutions of antibodies were made in PBS, 0 5 % BSA 
(w/v) , 0.05 % Tween 20 (vAr). 

[0057] The FAb fractions made as described above were then used to perform flow cytometric analyses in 911- 
40 hGATI -k08 cells expressing hGATI and compared to phages displaying done 25 (see example 3) (figure 17). For this 
purpose cells were incubated with 100 ml periplasmic fraction of clone 25 or control done for 1 hour at room tempera- 
ture followed by washing twice with PBS, 0. 1 % BSA and incubation with 500 times diluted anti-human kappa light chain 
antibodies (see above) for 30 minutes at room temperature. This was followed by washing twice with PBS/0.1 % BSA 
and a 30 minute room temperature Incubation with goat-anti-rabbit immunoglobulin antibodies conjugated with phyco- 
45 erythrin (diluted In 1 :20 in PBS/0.1 % BSA, Sigma P9795) and measurement in a flow cytometer. Detection of phage 
binding was done as described under example 2. 

[0058] Oearly FAb preparations of clone 25 bind to hGATI expressing cells whereas Ffih fragments of an Irrelevant 
GHO cell binding clone did not The results are very similar to the results observed with phages displaying FAb ctone 
25 (figure 1 7). Gompared to phages displaying hCATI binding FAb 25, FAb fragments of done 25 fadlitate the meas- 
so urement of hCATI in a multiparameter setting such as CD34* or CD^*\\n cells. 
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Stringent selection 


Non-stringent selection 


Selection 


Input phages 


Output 
phages 


Output input 


Input phages 


Output 
phages 


Output input 


500 nanoM 
peptide 


1.4x10^° 


0.6x10^ 


4.3x10"® 


1.4x10^° 


1.5X10^ 


1.1 X 10"^ 


500 nanoM 
peptide 


3.8x10^ 


1.1 X 10^ 


2.9 xlO*"* 


3.8x10^ 


9.8 X 10^ 


2.6x10'^ 


500 nanoM 
peptide 
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3.9 xlO"^ 


3.8x10^ 


9.0x10^ 


2.4x10"* 


500 nanoM 
peptide 


2.1x10® 
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2.9x10-3 
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6.2x10'^ 


100 nanoM 
peptide 


5.3x10^° 


2.4x10® 


4.5x10-2 


6.7x10^° 


1.5x10^ 


2.2xl0'2 
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Table 1 (continued) 





Stringent selection 


Non-stringent selection 


Selection 


Input phages 


Output 
phages 


Output Input 


Input phages 


Output 
phages 


Output Input 


100 nanoM 
peptide 


1.2x10^' 


1.8x10^° 


1.5x10-^ 


1.0x10^^ 


5.0 X 10^^ 


5.0 xlO'*' 


hCAT1 cells 
(k08) 


9.8x10^^ 


1.1x10^ 


1.1x10'® 


1.1x10^2 


7.2x10® 


6.5x10'* 


hCATl cells 
(k08) 


ND 


2x10"^ 










hCATI cells 
(k08) 


2.6x10^*^ 


2.1 xlO^ 


8.1 X 10-® 









Table 2 



Round 


Target 


Insert sequence 


No of Identical clones 


Amplified library 


None 


EQSRPSWQLTPT 


1 






QTHQLLRKPPSF 


1 






YMHEPITPNPVT 


1 






WHHIPNSAKISL 


1 






SENLTLMTVLQM 


1 






NLMPPPVPRLPL 


1 






TPQGVHYHPNMR 


1 


1 


hCATI peptide 


ND 




2 


hCATI peptide 


ND 




3 


hCATI peptide 


TLNNHTTPPAWN 


1 






QWHSPFPTSRP 


1 


4 


hCATI peptide 


ND 




5 


hCATI peptide 


FEQHNWWDSHPQ 


1 






NTFDLWLQSVPQ 


7 


6 


hCATI peptide 


FEGCHPQSGLSC 


1 






FEQHNWWDSHPQ 


1 






NTFDLWLQSVPQ 


5 






SVSVGMKPSPRP 


4 


1 


HCATI cells 


SVSVGMKPSPRP 


4 


2 


hCATI cells 


SVSVGMKPSPRP 


4 


3 


hCATI cells 


SVSVGMKPSPRP 


23 
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Tables 



Binding and Intemalisation of phages displaying peptide SVSVGMKP- 

SPRP 


Celi-line: 


Temp: 


Phage rescue: 


#pfux1000: 


















clone #26 


12-mer library 












911-hCATI 


37C 


Elution 


120 


4.32 






Lysis 


72 


18.72 


911-hCATI 


37C 


Whole sample iysis 


205.2 


88.2 












911i3cDN/^ 


37C 


Elution 


55.68 


3.84 






Lysis 


47.52 


8.64 


911-PCDNA3 


37C 


Whole sample lysis 


216 


0 



Table 4 



Selection 


input phages 


output phages 


Output/input 
ratio 


500 nanoM peptide 


2.7x10^2 


9.0x10^ 


3.6 X 10'^ 


500 nanoM peptide 


5.7x10^2 
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3.3x10-^ 


100 nanoM peptide 
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1.6x10-3 
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hCATI cells (k08) 


5.4x10^2 


1.7x10^ 


3.1 X 10"^ 


hCATI cells (k08) 


5.4x10^2 


1.5x10^ 


2.8x10-^ 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: 

(A) NAME: Introgene B.V. 

(B) STREET: Wassenaarseweg 72 

(C) CITY: Leiden 

(D) STATE: Zuid-Holland 

(E) COONTRr: the Netherlands 

(F) POSTAL CODE (ZIP) : 2333 AL 

(ii) TITLE OF INVENTION: Targeted delivery through a cationic 
amino 

acid transporter. 

(iii) NUMBER OF SEQUENCES: 59 

(iv) COMPUTER READABLE FORM; 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 (EPO) 

(V) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: EP 98201693.3 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 
(B> TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknow 

(li) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Lys Asn Trp Gin Leu Thr Glu Glu Asp Phe Gly Asn Thr Ser Gly Arg 
is 10 15 

Leu Cys Leu Asn Asn Asp Thr Lys Glu Gly Lys Pro Glv Val Glv Glv 
20 25 30 

Phe 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: '20 base pairs 

(B) TYPE: nucleic acid 

(C) . STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CCCTCATAGT TAGCGTAACG 
20 

(2> INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHABACTERISTICS : 

(A) LENGTH: 12 aiaino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

^ Ser Val Ser Val Gly Met Lys Pro Ser Pro Arg Pro 

1 • 5 10 

(2) INFORMATION FOR SEQ ID NO: 4: 

(1) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



30 



40 



45 



so 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



35 • (xi) SEQUENCE DE. SCRIPT ION: SEQ ID NO: 4: 

Glu Gin Ser Arg Pro Ser Trp Gin Leu Thr Pro Thr 
15 10 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ IQ NO: 5: 
Gin Thr His Gin Leu Leu Arg Lys Pro Pro Ser Phe 
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15 10 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SfiQOENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANOEOHCSS: unknown 

(D) TOPOLOGY! unknown 

(li) MOLBCOLE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Tyr Met His Glu Pro He Thr Pro Asn Pro Val Thr 
15 10 

(2) INFORMATION FOR SEQ ID NO: 7: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STFANDEDNESS: unknown 
ID) TOPOLOGY: unknown 

(iij MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Trp His His He Pro Asn Ser Ala Lys He Ser Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STFANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Ser Glu -Asn Leu Thr Leu Met Thr Val Leu Gin Met 
15 10 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 amino acids 
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(B) TYPE: amino acid 

(C) STRAMDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(li) MOLECULE TYPE: peptide 
Uii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Asn Leu Met Pro Pro Pro Val Pro Arg Leu Pro Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 azaino acids 

(B) TYPE: amino acid 

(C) 5TRAKDEDNES5: unknown 
<D) TOPOLOGY: unknown 

(ii» MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Thr Pro Gin Gly Val His Tyr His Pro Asx Asn Met Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Thr Leu Asn Asn His Thr Thr Pro Pro Ala Trp Asn 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNE5S: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; peptide 
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(iiij HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Gin Val Val His Ser Pro Phe Pro Thr Set Arg Pro 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 13: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 
(B> TYPE: amino acid 

(C) STRAHDEDNBSS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Phe Glu Gin His Asn Trp Trp Asp Ser His Pro Gin 
15 10 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 14: 

Asn Thr Phe Asp Leu Trp Leu Gin Ser Val Pro Gin 
15 10 

(2) INFORMATION TOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12. amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1*5: 
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Phe Glu Gly Cys His Pro Gin Ser Gly Leu Ser Cys 
1 5 . .10 

(2) IMFOBMATIOK FOR SEQ ID NO: 16: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: un3cziown 

(D) TOPOLOGY: un)cnown 

(ii) MOLECULE TYPE; other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQOENCE DESCRIPTION: SEQ ID NO: 16: 

TTTGAGCAGC ATAATTGGTG GGATTCGCAT CCTCAGCCCC CG6GGCCCCC TTGT 
54 

(2) INFORMATION EX)R SEQ ID NO: 17: 

(i) SEQOENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) 5TRANDEDNE5S: unknown 
(0) TOPOLOGY: unXnown 

(ii) MOLECOLE TYPE: other nucleic acid 

<iii) HYPOTHETICAL: NO 



(xi) SEQOEHCE DESCRIPTION: SEQ ID HO: 17: 

CTGAGGATGC GAATCCCACC AATTATGCTG CTCAAAGGAT TGATATTCTA GCCC 
'54 

(2) INFORMATION FOR SEQ ID NO: 18: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

Hi) MOLECULE TYPE: other nucleic acid 
<iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

TTTGAGCAGC ATAATTGGTG GGATTCGCAT CCTCAGCGGT GCAACACTGC CTGG 
54 

(2) INFORMATION FOR SEQ ID NO: 19: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) IiKNGTH: 54 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNBSS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CTGAGGATGC GAATCCCACC AATTATGCTG CTCAAAAGGT TCTTCGCAGT CTCT 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 
(D| TOPOLOGY: unknown 

(li) MOLECULE TYPE: other nucleic acid 

(lii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

AATACTTTTG ATCTTTGGCT GCAGTCTGTT CCTCAGCCCC CGGGGCCCCC TTGT 
54 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE. TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

CTGAGGAACA GACTGCAGCC AAAGATCAAA AGTATTGGAT TGATATTCTA GCCC 
54 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown . 
(0) TOPOLOGY: unknown 
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(11) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEOUENCE DESCRIPTION: SEQ ID NO: 22: 

AATACTTTTG ATCTTTGGCT GCAGTCTGTT CCTCAGCGGT GCAACACTGC CTGG 
54 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii> HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

CTGAGGAACA GACTGCAGCC AAAGATCAAA AGTATTAGGT TCTTCGCAGT CTCT 
54 

(2) INFORHATION FOR SEQ ID NO: 24: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: unknown 
(D) TOPOLOGY: unknown 

(11) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

TCTGTTTCTG TGGGTATGAA GCCGA6TCCT AGGCCTCCCC CGGGGCCCCC TTGT 
54 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
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{xi> SEQUENCE DESCRIPTION: SEO ID NO: 25: 

AGGCCTAGGA CTCGGCTTCA TACCCACAGA AACAGAGGAT TGATATTCTA GCCC 
54 

(2) INFORMATION FOR SEQ ID NO: 26: 

li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

TCTGTTTCTG TGGGTATGAA GCC6AGTCCT AGGCCTCGGT GCAACACTGC CTGG 
54 

(2) INFORMATION FOR SEQ ID NO: 27: 

<i] SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: unknown 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

AGGCCTAGGA CTCGGCTTCA TACCCACAGA AACAGAAGGT TCTTCGCAGT CTCT 
54 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 
ID) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

TTTGAGGG6T GTCATCCTCA 6TCGGGGCTG TCTXGTCCCC CGGGGCCCCC TTGT 
54 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 
{D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 29: 

ACAAGACAGC CCCGACTGAG GATGACACCC CTCAAAGGAT TGATATTCTA GCCC 
54 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknovm 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: MO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

TTTGAGGGGT GTCATOCTCA GTCGGGGCTG TCTT6TCGGT GCAACACTGC CTGG 
54 

(2) INFORMATION FOR SEQ ID NO: 31: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

ACAAGACAGC CCCGACTGAG* GATGACACCC CTCAAAAGGT TCTTCGCAGT CTCT 
54 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 92 base pairs * 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 



27 



EP 0 959 136 A1 



(D) TOPOLOGY: unknown 
(U) MOLECOLE TYPE: other nucleic acid 
tiii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

ATCACCTGGG AGGTAACCGG CCATATGTTT 6AGCAGCATA ATTGGTGGGA TTCGCATCCT 
60 

CAGGGTGCTA GCTTGGTAAC CAATGGAGAT CG 
92 

(2) INFORMATION FOR 5EQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

CGATCTCCAT TGGTTACCAA GCTAGCACCC TGAGGATGCG AATCCCACCA ATTATGCTGC 
60 

TCAAACATAT GGCCGGTTAC CTCCCAGGTG AT 
92 

(2) INFORMATION FOR SEQ ID NO: 34: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

ATCACCTGGG AGGTAACCGG* CCATATGAAT ACTTTTGATC TTTGGCTGCA GTCTGTTCCT 
60 

CAGGGTGCTA GCTTGGTAAC CAATGGAGAT CG 
92 

(2) INFORMATION FOR SEQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) T0P0LCX5Y: unknown 



(ii) MOLECULE TYPE: other nucleic acid 



(iii) HYPOTHETICAL: NO 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

CGATCTCCAT TGGTTACCAA GCTAGO^CCC TGAGGAACAG ACTGCAGCCA AAGATCAAAA 
60 



GTATTCATAT GGCCGGTTAC CTCCCAGGTG AT 
92 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



ATCACCTGGG AGGTAACCGG CCATATGTCT GTTTCTGTGG GTATGAAGCC (3AGTCCTAGG 
60 

CCTGGTGCTA GCTTGGTAAC CAATGGAGAT CG 
92 

(2). INFORMATION FOR SEQ 10 NO: 37: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



IS 



30 



(Xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



45 



(Xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



60 



CGATCTCCAT TGGTTACCAA GCTAGCACCA GGCCTAGGAC TCGGCTTCAT ACCCACAGAA 
60 



ACAGACATAT GGCCGGTTAC CTCCCAGGTG AT 
92 



55 
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(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
ATCACCTGGG AGGTAACCG6 CCATAT6TTT GAGGGGTGTC ATCCTCAGTC GGGGCTGTCT 



TGTGGTGCTA GCTTGGTAAC CAATGGAGAT CG 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
CGATCTCCAT TGGTTACCAA GCTAGCACCA CAAGACAGCC CCGACTGAGG ATGACACCCC 



TCAAACATAT GGCCGGTTAC CTCCCAGGTG AT 
92 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEONESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Gly His Met Phe Glu Gin His Asn Trp Trp Asp Ser His Pro Gin Gly 
15 10 15 
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Ala Ser Leu Val Thr 
20 

(2) IMFORMATIOM FOR SEQ ID MO: 41; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unJcnowi 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Gly His Met Asn Thr Phe Asp Leu Trp Leu Gin Ser Val Pro Gin Gly 
15 10 15 

Ala Ser Leu VaX Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unytnown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

. Gly His Met Ser Val Ser Val Gly Met Lys Pro Ser Pro Arg Pro Gly 
15 10 15 

Ala Ser Leu Val Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPfe: peptide 
(iii) HYPOTHETICAL: NO 



(xij SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
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Gly His Met Phe GIu Gly Cys His Pro Gin Ser Gly Leu Ser Cys Gly 
1 5 10 15 • 

Ala Ser Leu Val Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STEANDEDNESS: unknovm 

(D) TOPOLOGY: unknown 

(11) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

TGAGCAGCAT AATTG6TGGG 
20 

(2) INFORMATION TOR SEQ ID NO: 45: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

TTGATCTTTG GCTGCAGTCT 
20 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL.: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

TCTG7GGGTA TGAAGCCGAG 
20 



32 



EP 0 959 136 A1 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECOLE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

TTTGAGGGGT GTCATCCTCA 
20 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPES Other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

6TCCTA6ATT TTGGTATCTG 
20 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 
(0) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

CTCGCTCGCC CATATGCGGC 'CGCAGGTCTC CTCCTCTTAG CAGCACAACC AGCAATGGCC 
60 

GCTTCGCCCG GCTCC 
75 

(2) INFORMATION FOR SEQ ID NO: 50: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 27 base pairs 
(B> TYPE: nucleic acid 

(C) S7RAMDEDNESS: unknown 

(D) TOPOLOGY: unknown • 

(ii) MOLECULE TYPE: other nucleic acid 
. (iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50; 

AGCATCACTA 6TCGCCGGTG GAAGTTG 
27 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQOENCB CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

AAAAACTGGC AGCTCACGGA GGAGGATTTT GGGAACACAT CAGGCCGTCT C 
51 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Lys Asn Trp Gin Leu Thr Glu Glu Asp Phe Gly Asn Thr Ser Gly Arg 
15 10 15 

Leu 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 
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(0) TOPOLOGY: un)cnown 
(ii) MOLECULE TYPE: other nucleic acid 
(iil) HYPOTHETICAL: NO 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

T6TTT6AACA ATGACACAAA AGAAGGGAAG CCCGGT6TTG GTGGATTC 
48 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRAMDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; peptide 
(iii) HYPOTHETICAL: NO 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 54; 

Cys Leu Asn Aan Asp Thr Lys Glu Gly Lys Pro Gly Val Gly Gly Phe 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRAMDEDNESS: unlcnown 

(D) TOPOLOGY: unknown 

. (ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Lys Arg Arg Asn Asn Asp Thr Lys Glu Gly Lys Pro Gly Val Gly Gly 
15 10 15 

Phe Met Pro Phe Gly Phe Ser Gly Val Leu Ser 
20 25 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown . 

(D) TOPOLOGY: unkno%m 
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(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Lys Arg Arg Asn Asn Asp Thr Asn Val Lys Tyr Gly Glu Gly Gly Phe 
15 10 15 

Met Pro Phe Gly Phe Ser Gly Val Leu Ser 
20 25 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5925 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: un)cnotm 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

GACGAAAGGG CCTCGTGATA CGCCTATTTT TATAGGTTAA TGTCATGATA ATAATGGTTT 
60 

CTTAGACGTC AGGTGGCACT TTTCG6GGAA ATGTGCGCGG AACCCCTATT TGTTTATTTT 
120 

TCTAAATACA TTCAAATATG TATCCGCTCA TGAGACAATA ACCCTGATAA ATGCTTCAAT 
180 

AATATTGAAA AAGGAAGAGT ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT 
240 

TTGCGGCATT TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 
300 

CTGAAGATCA GTTGGGTGCC CGAGTGGGTT ACATCGAACT GGATCTCAAC AGCGGTAAGA 
360 

TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAAT6AT GAGCACTTTT AAAGTTCTGC 
420 

TATGTGGCGC GGTATTATCC CGTATTGACG CCGGGCAAGA GCAACTCGGT CGCCGCATAC 
480 

ACTATTCTCA GAATGACTTG GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG 
540 

GCATGACAGT AA6AGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 
600 

ACTTACTTCT <avCAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG CACAACATGG 
660 

GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT GAATGAAGCC ATACCAAACG 
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720 

ACGAGCGTGA CACCAC6ATG CCTGTA6CAA 
780 

GCGAACTACT TACTCTAGCT TCCCGGCAAC 
640 

TTGCAGGACC ACTTCTGCGC TCGGCCCTTC 
900 

GAGCCGGTGA GCGT6GGTCT CGCGGTATCA 
960 

CCCGTATCGT AGTTATCTAC ACGACGGGGA 
1020 

AGATCGCTGA GATAGGTGCC TCACTGATTA 
1080 

CATATATACT TTAGATTGAT TTAAAACTTC 
1140 

TCCTTTTTGA TAATCTCATG ACCAAAATCC 
1200 

CAGACCCCGT A6AAAAGATC AAAGGATCTT 
1260 

GCTGCTTGCA AACAAAAAAA CCACCGCTAC 
1320 

TACCAACTCT TTTTCCGAAG GTAACTGGCT 
1380 

TTCTAGTGTA GCCGTAGTTA GGCCACCACT 
1440 

TCGCTCTGCT AATCCTGTTA CCAGTGGCTG 

1500 

GGTTGGACTC AAGACGATAG TTACCGGATA 
' 1560 

CGTGCATACA GCCCAGCTTG GAGCGAACGA 
1620 

AGCATTGAGA AAGCGCCACG CTTCCCGAAG 
1680 

GCAGGGTCGG AACAGGAGAG CGCACGA6GG 
1740 

ATAGTCCTGT CGGGTTTCGC CACCTCTGAC 
1800 

GGGGGCGGAG CCTATGGAAA AACGCCAGCA 
1860 

GCTGGCCTTT TGCTCACATG TTCTTTCCTG 
1920 

TTACCGCCTT TGAfiTGAGCT GATACCGCTC 
1960 

CAGTGAGCGA GGAAGCGGAA GA6CGCCCAA 



TGGCAACAAC GTTGCGCAAA CTATTAACTG 
AATTAATAGA CTGGATGGAG GCGGATAAAG 
CGGCTGGCTG GTTTATTGCT GATAAATCTG 
TTGCAGCACT GGGGCCAGAT GGTAAGCCCT 
GTCAGGCAAC TATGGATGAA CGAAATAGAC 
AGCATTG6TA ACTGTCAGAC CAAGTTTACT 
ATTTTTAATT TAAAAGQATC TAGGTGAAGA 
CTTAACGTGA GTTTTCGTTC CACTGAGCGT 
CTTGAGATCC TTTTTTTCTG CGCGTAATCT 
CAGCGGTGGT TTGTTTGCCG GATCAAGAGC 
TCAGCAGAGC GCAGATACCA AATACTGTCC 
TCAAGAACTC TGTAGCACCG CCTACATACC 
CTGCCAGTGG CGATAAGTCG TGTCTTACCG 
AGGCGCAGCG GTCGGGCTGA ACGGGGGGTT 
CCTACACCGA ACTGAGATAC CTACAGCGTG 
6GAGAAAGGC GGACAGGTAT CCGGTAAGCG 
AGCTTCCAGG GGGAAACGCC TGGTATCTTT 
TT6AGCGTC6 ATTTTTGTGA TGCTCGTCAG 
ACGCGGCCTT TTTACGGTTC CTGGCCTTTT 
CGTTATCCCC TGATTCTGTG GATAACCGTA 
GCC6CA6CC6 AACGACCGAG C6CAGCGAGT 
TACGCAAACC GCCTCTCCCC. GCGCGTTGGC 
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2040 

CGATTCATTA ATGCAGCT6G CACGACAGGT 
2100 

ACGCAATTAA TGTGAGTTAG CTCACTCATT 
2160 

CGGCTCGTAT GTTGTGTGGA ATTGTGAGCG 
2220 

ACCATGATTA CGCCAAGCTT TGGAGCCTTT 
22B0 

TTATTCGCAA TTCCTTTAGT TGTTCCTTTC 
2340 

ACGCftGTCTC CAGGCATCCT GTCTTTGTCT 
2400 

GCCAGTCAGA GTGTCAGCAG CAGGAACTTA 
2460 

CCCAGGCTCC TCATCTATGG TGTATCCAAC 
2520 

GGCAGTGGGT CTGGGGCAGA CTTCACTCTC 

2580 

GCGGTGTATT ACTGTCAGCG GTATGGCAGG 
2640 

GTGGAGATCA A2VCGTGGAAC TGTGGCTGCA 
2700 

GAGCAGTTGA AATCTGGAAC TGCCTCTGTT 
2760 

GAGGCCAAAG TACAGTGGAA GGTGGATAAC 
2820 

GTCACAGAGC AGGACAGCAA 6GACAGCACC 
2880 

AAAGCAISACT ACGAGAAACA CAAAGTCTAC 
2940 

TCACCGGTGA CAAAGAGCTT CAACA6GGGA 
3000 

CAAGGAGACA GTCATAAT6A AATACCTATT 
3060 

CGCGGCCCAG CXTGGCCATGG CCCAGGTCCA 
3120 

GCCTGG6AGG TCCCTSAGAC TCTCCTGTGC 
3180 

TATGCACTGG GTCCGCCAGG CTCCAGGCAA 
3240 

TGATGGAAGC AATAAATACT ACGCAGACTC 
3300 

CAATTCCAAG AACACGCT6T ATCTGCAAAT 



TTCCCGACTG GAAAGCGG6C AGTGAGCGCA 
AGGCACCCCA GGCTTTACAC TTTATGCTTC 
GATAACAATT TCACACAGGA AACAGCTATG 
TTTTTGGAGA TTTTCAACGT GAAAAAATTA 
TATTCTCACA GTGCACTTGA AACGACACTC 
CCGGGGGCAG GAGCCACCCT CTCCTGCAGG 
GCCTGGTACC AGCAGAAACC TGGCCAGGCT 
AGGGCCACT6 GCGTCCCAGA CAGGTTCAGT 
ACCATCAACA GACTGGAGCC TGAAGAmT 
TCACTGTGGA CGTTCGGTCA AGGGACCAAG 
CCATCTGTCT tCATCTTCCC GCCATCTGAT 
GTGTGCCTGC TGAATAACTT CTATCCCAGA 
GCCCTCCAAT CGGGTAACTC CCAGGAGAGT 
TACAGCCTCA GCAGCACCCT GACGCTGAGC 
GCCTGCGAAG TCACCCATCA GGGCCTGAGT 
GAGTGTTAAT AAGGCGCGCC AATTCTATTT 
GCCTACGGCA GCCGCTGGAT TGTTATTACT 
GCTG6TGCAG TCTGGGGGAG 6CGTGGTCCA 
AGCCTCTGGA TTCACCTTCA GTAGCTATGC 
GGGGCTGGAG TGGGTGGCAG TTATATCATA 
CGTGAAGGGC CGATTCACCA TCTCCAGAGA 
GAACAGCCra AGAGCTGAGG ACACGGCTGT 
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3360 

GTATTACTGT GCGAGAGGGA TTACAGTAAC 
3420 

CACCCTGGTC ACCGTCTCAA GCGCCTCCAC 
3480 

CTCCTCCAAG AGCACCTCTG GGGGCACAGC 
3540 

CCCCGAACCG GTGACGGTGT CGTGGAACTC 
3600 

CCCGGCTGTC CTACAGTCCT CAGGACTCTA 
3660 

CAGCAGCTTG GGCACCCAGA CCTACATCTG 
3720 

G6TGGACAAG AAAGTTGAGC CCAAATCTTG 
3780 

GGCCGCAGAA CAAAAACTCA TCTCAGAASA 
3840 

AAGTTGTTTA GCAAAACCTC ATACAGAAAA 
3900 

AACTTTAGAT CGTTACGCTA ACTATGAGGG 
3960 

TTGTACTGGT GACGAAACTC AGTGTTACGG 
4020 

tGAAAATGAG GGTGGTGGCt CTGAGGGTGG 
4080 

CGGTACTAAA CCTCCTGAGT ACGGTGATAC 
4140 

TCTCGACGGC ACTTATCCGC CTGGTACTGA 
' 4200 

TGAGGAGTCT CAGCCTCTTA ATACTTTCAT 
4260 

GGGTGCATtA ACT6TTTATA CGGGCACTGT 
4320 

TTACCAGTAC ACTCCTGTAT CATCAAAAGC 
4380 

CAGAGACTGC GCTTTCCATT CTG6CTTTAA 
4440 

CCAATCGTCT GACCTGCCTC AACCTCCTGT 
4500 

TG6TGGCGGC TCTGAGGGTG GCGGCTCTGA 
4560 

GGGTGGCGGT TCCGGTGGCG GCTCCGGTTC 
4620 

CGCTAATAAG GGGGCTATGA CCGAAAATGC 



TAAATCACGA TTTGACTACT GGGGCCAGGG 
CAAGGGCCCA TCGGTCTTCC CCCTGGCACC 
GGCCCTGGGC TGCCTGGTCA AGGACTACTT 
AGGCGCCCTG ACCAGCGGCG TCCACACCTT 
CTCCCTCAGC AGCGTAGTGA CCGTGCCCTC 
CAACGTGAAT CACAAGCCCA GCAACACCAA 
TGCGGCCGCA CATCATCATC ACCATCACGG 
GGATCTGAAT GGGGCCGCAT AGACTGTTGA 
TTCATTTACT AACGTCTGGA AAGAC6ACAA 
CTGTCTGTGG AATGCTACAG GCGTTGTGGT 
TACATGG6TT CCTATTGGGC TTGCTATCCC 
CGGTTCTGAG GGTGGCGGTT CTGAGGGTGG 
ACCTATTCCG GGCTATACTT ATATCAACCC 
GCAAAACCCC GCTAATCCTA ATCCTTCTCT 
GTTTCAGAAT AATAGGTTOC GAAATAG6CA 
TACTCAAGGC ACTGACCCCG TTAAAACTTA 
CATGTATGAC GCTTACTGGA ACGGTAAATT 
TGAGGATCCA TTCGTTTGTG AATATCAAGG 
CAATGCTGGC GGCGGCTCTG GTGGTGGTTC 
GGGTGGCGGT TCTGAGGGTG GCGGCTCTGA 
CGGTGATTTT GATTATGAAA AAATGGCAAA 
C6ATGAAAAC GCGCTACAGT CTGACGCTAA 
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4680 

AGGCAAACTT GATTCTGTCG CTACTGATTA CGGTGCTGCT ATCGATGGTT TCATTGGTGA 
4740 

CGTTTCCGGC CTTGCTAATG GTAATGGTGC TACTGGTGAT TTTGCTGGCT CTAATTCCCA 
4800 

AATGGCTCAA GTCGGTGACG GTGATAATTC ACCTTTAATG AATAATTTCC GTCAATATTT 
4860 

ACCTTCTTTG CCTCAGTCGG TTGAATGTCG CCCTTATGTC TTTGGCGCTG GTAAACCATA 
4920 

TGAATTTTCT ATTGATTGT6 ACAAAATAAA CTTATTCC6T GGTGTCTTTG CGTTTCTTTT 
4980 

ATATGTTGCC ACCTTTATGT ATGTATTTTC GACGTTTGCT AACATACTGC GTAATAAGGA 
5040 

GTCTTAATAA GAATTCACTG GCCGTCGTTT TACAACGTCG TGACIGGGAA AACCCTGGCG 
5100 

TTACCCAACT TAATCGCCTT GCAGCACATC CCCCTTTCGC CAGCTGGCGT AATAGC6AA6 
5160 

AGGCCCOCAC CGATCGCCCT TCCCAACAGT TGCGCAGCCT GAATGGCGAA TGGCGCCTGA 

5220 

TOCGGTATTT TCTCCTTACG CATCTGTGCG GTATTTCACA CCGCATATAA ATTGTAAACG 
5280 

TTAATATTTT GTTAARATTC GCGTTARATr TTTGTTAAAT CAGCTCATTT TTTRACCAAT 
5340 

AGGCCGAAAT CGGCAAAATC CCTTATAAAT CAAAAGAATA GCCCGAGATA GG6TTGAGTG 
5400 

TTGTTCCAGT TTGGAACAAG AGTCCACTAT TAAAGAACGT G6ACTCCAAC GTCAAAGGGC 
5460 

GAAAAACCGT CTATCAGGGC GATGGCCCAC TACGTGAACC AtCACCCAAA TCAAGTTTTT 
^520 

TGGGGTCGAG GTGCCGTAAA GCACTAAATC GGAACCCTAA A6GGAGCCCC CGATTTAGAG 
5580 

CTTGACGGGG AAAGCCGGC6 AACGT6GCGA GAAAGGAAGG GAAGAAAGCG AAAGGAGCG6 
5640 

GCGCTAGGGC GCTGGCAAGT 6TAGCGGTCA CGCTGCGCGT AACCACCACA CCCGCCGCGC' 
5700 

TTAATGCGCC 6CTACAGGGC GCGTACTATG 6TTGCTTTGA CGGGTGCAGT CTCAGTACAA 
5760 

TCTGCTCTGA TGCCGCATAG TTAAGCCAGC CCCGACACCC GCCAACACCC GCTGACGCGC 
5820 

CCTGACGGGC TXGTCTGCTC CCGGCATCCG CTTACAGACA AGCTGTGACC GTCTCCGGGA 
5880 

GCTGCATGTG TCAGA6GTTT TCACCGTCAT CACCGAAACG CGCGA 
5925 

(2) INFORMATION FOR S£Q ID NO: 58: 
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(i)- SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 236 ainino acids . 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unlcnown 

(D) TOPOI/3GY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Val Lys Lys Leu Leu Phe Ala lie Pro Leu Val Val Pro Phe Tyr Ser 
15 10 15 

His ser Ala Leu Glu Thr Thr Leu Thr Gin Ser Pro Gly lie Leu Ser 
20 25 30 

Leu Ser Pro Gly Ala Gly Ala Thr Leu Ser Cys Arg Ala Ser Gin Ser 
35 40 45 

Val Ser Ser Arg Asn Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ala 
50 55 60 

Pro Arg Leu Leu lie Tyr Gly Val Ser Aan Arg Ala Thr Gly Val Pro 
65 70 75 80 

Asp Arg Phe Ser Gly Ser Gly Ser Gly Ala Asp Phe Thr Leu Thr lie 
85 90 95 

Asn Arg Leu Glu Pro Glu Asp Phe Ala Val Tyr Tyr Cys Gin Arg Tyr 
100 105 110 

Gly Arg Ser Leu Trp Thr Phe Gly Gin Gly Thr Lys Val Glu lie Lys 
115 120 125 

Arg Gly Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro Ser Asp 
130 135 140 

Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn 
145 150 155 160 

Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn Ala Leu 
165 170 175 

Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lys Asp 
180 185 190 

Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr 
195 200 205 

Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser 



210 



215 



220 



• Ser Pro 
225 



Val 



Thr Lys 



Ser 
230 



Phe 



Asn Arg Gly Glu 
235 



Cys 



(2) INFORMATION FOR SEQ ID NO: 59: 



(i) SEQUENCE CHARACTERISTICS: , 
{A) LENGTH: 254 amino acids 
(B) TYPE: amino acid 
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(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide / 
Ciii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
^5 10 15 

Ala Gin Pro Ala Met Ala Gin Val Gin Leu Val Gin Ser Gly Gly Gly 
20 25 30 

Val Val Gin Pro Gly Arg Ser Leu Arg Leu Sex Cys Ala Ala Ser Gly 
35 40 45 

Phe Thr Phe Ser Ser Tyr Ala Met His Trp Val Arg Gin Ala Pro Glv 
50 55 60 

Ly» iSly Leu Glu Trp Val Ala Val He Ser Tyr Asp Gly Ser Asn Lys 

65 70 75 80 

Tyr Tyr Ala Asp Ser Val Lys Gly Arg Phe Thr He Ser Arg Asp Asn 
85 90 95 

Ser Lys Asn Thr Leu Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp 
100 105 110 

Thr Ala Val Tyr Tyr Cys Ala Arg Gly He Thr Val Thr Lys Ser Arg 
115 120 125 

Phe Asp Tyr Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser 
130 135 140 

Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr 
145 150 155 160 

Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro 
165 170 175 

Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 
180 185 190 

His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser 
195 200 205 

Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr Tyr He 
210 215 220 

Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val 
225 230 235 240 

Glu Pro Lys Ser Cys Ala Ala Ala His His His His His His 
245 250 
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Claims 

1 . A virus-iike partici e or gene delivery vehicle provided with a ligand capabi e of binding to a human amino add trans- 
porter. 

2. A virus-like particle or gene delivery vehicle according to claim 1 for delivery of genes to human cells. 

3. A virus-like particle or gene delivery vehicle according to claim 1 or 2 comprising at least one viral protein provided 
with said ligand. 

4. A virus-like particle or gene delivery vehicle according to claim 3 wherein said viral protein comprises an envelope 
protein. 

5. A virus-like particle or gene delivery vehicle according to claim 4 derived from a retrovirus. 

6. A virus-like particle or gene delivery vehicle according to claim 3 wherein said viral protein comprises a capsid pro- 
tein. 

7. A virus-like particle or gene delivery vehicle according to claim 6 derived from an adeno virus. 

8. A virus-like particle or gene delivery vehide according to any one of daims 1 to 7 wherein said amino acid trans- 
porter is a cationic amino acid transporter. 

9. A virus-like particle or gene delivery vehicle according to d£um 8 wherein said transporter is a human cationic 
amino add transporter-1 (hCAT1). 

10. A virus-like partide or gene delivery vehide according to any one of dainr>s 1 to 9 wherein said ligand comprises 
an amino acid sequence selected from Table 2, preferably from the last four different sequences of Table 2 or a 
sequence functionally related thereto. 

11. A virus-like particle or gene delivery vehide according to daim 10 wherein said ligand comprises at least a part of 
the amino acid sequence SVSVGMKPSPRP. 

1 2. A virus-like particle or gene delivery vehicle according to any one of claims 1 to 9 wherein said ligand comprises a 
fragment derived from a phage displaying at least one antibody fragment selected for its capacity to bind with said 
amino add transporter. 

13. A virus-like partide or gene delivery vehk^le according to daim 12 wherein said antibody fragment comprises an 
amino add sequence as shown in Figure 16 or an amino acid sequence functionally equivalent thereta 
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